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(57) Abstract 

A method for isolating mRNAs as cDNAs employs a polymerase amplification method using at least two oligodeoxynu- 
cleotide primers. In one approach, the Tirst primer contains sequence capable of hybridizing to a site immediately upstream of the 
first A ribonucleotide of the mRNA's poly A tail and the second primer contains arbitrary sequence. In another approach, the 
first primer contains sequence capable of hybridizing to a site including the mRNA's polyA signal sequence and the second pri- 
mer contains arbitrary sequence. In another approach, the first primer contains arbitrary sequence and the second primer con- 
tains sequence capable of hybridizing to a site including the Kozak sequence. In another approach, the first primer contains a se- 
quence that is substantially complementary to the sequence of a mRNA having a known sequence and the second primer contains 
arbitrary sequence. In another approach, the first primer contains arbitrary sequence and the second primer contains sequence 
that is substantially identical to the sequence of a mRNA having a known sequence. The first primer is used as a primer for re- 
verse transcription of the mRNA and the resultant cDNA is amplified with a polymerase using both the first and second primers 
as a primer set. 
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"Methods to clone mRNA", 

Background of the Invention 

This s^Iication is a continuation-in-pait of the co-pending application U.S. 
Serial No. 07/850,343, filed on March 11, 1992. 
5 This invention relates to methods of detecting and cloning of individual 

mRNAs. 

The activities of genes in cells are reflected in the kinds and quantities of 
their mRNA and protein species. Gene expression is crucial for processes such 
as aging, development, differentiation, metabolite production, progression of the 

10 cell cycle, and infectious or genetic or other disease states. Identification of the 
expressed mSNAs will be valuable for the elucidation of their molecular 
mechanisms, and for applications to the above processes. 

Mammalian cells contain approximately 15,000 different mSNA sequences, 
however, each mRNA sequence is present at a different frequency within the 

15 cell. Generally, mRNAs are e}q)ressed at one of three levels. A few "abundant" 
mRNAs are present at about 10,000 copies per cell, about 3,000-4,000 
"intermediate*" mRNAs are present at 300-500 copies per cell, and about 11,000 
"low-abundance" or "rare" mRNAs are present at approximately 15 copies per 
cell. The numerous genes that are rq)resented by intermediate and low 

20 frequencies of their mRNAs can be cloned by a variety of well established 
techniques (see for example Sambrook et al,y 1989, Molecular Cloning: A 
Laboratory Manual, Second Edition, Cold Spring Harbor Press, pp. 8.6-8.35). 

If some knowledge of the gene sequence or protein is had, several direct 
cloning methods are available. However, if the identity of the desired gene is 

25 unknown one must be able to select or enrich for the desired gene product in 
order to identify the "unknown" gene without expending large amounts of time 
and resources. 

The identification of unknown genes can often involve the use of 
subtractive or differential hybridization techniques. Subtractive hybridization 
30 techniques rely upon the use of very closely related cell populations, such that 
differences in gene expression will primarily represent the gene(s) of interest. A 



wo 93/18176 



PCr/US93/02246 



-2- 

key dement of the subtractive hybridizatioii technique is the construction of a 
compiehensive complementaiy-DNA ("cDNA") libiaiy. 

The constmction of a compiehensive cDNA libiacy is now a fairly routine 
procedure. PolyA mRNA is prepared from the desiied cells and the first strand 
5 of the cDNA is synthesized using RNA-dependent DNA polymerase ("reverse 
transcriptase") and an oligodeoxynucleotide primer of 12 to 18 thymidine 
residues. The second strand of the cDNA is synthesized by one of several 
methods, the more efficient of which are conunonly known as "replacement 
synthesis" and "primed synthesis". 

10 Replacement synthesis involves the use of ribonuclease H ("RNAase H"), 

which cleaves the phosphodiester backbone of RNA that is in a KNArDNA 
hybrid leaving a 3' hydroxyl and a 5' phosphate, to produce nicks and g^s in 
the mRNA strand, creating a series of RNA primers that are used by coU 
DNA polymerase I, or its "Klenow" fragment, to synthesize the second strand of 

15 the cDNA. This reaction is very efficient; however, the cDNAs produced most 
often lack the 5' terminus of the mRNA sequence. 

Primed synthesis to generate the second cDNA strand is a general name for 
several methods which are more difficult than replacement synthesis yet clone the 
5' terminal sequences with high efficiency. In general, after the synthesis of tiie 

20 first cDNA strand, the 3' end of the cDNA strand is extended with terminal 
transferase, an enzyme which adds a homopolymeric "tail" of deoxynucleotides, 
most commonly deoxycytidylate. This tail is then hybridized to a primer of 
oligodeo}grguanidylate or a synthetic ftagment of DNA with an deoxyguanidylate 
tail and the second strand of the cDNA is synthesized using a DNA-dependent 

25 DNA polymerase. 

The primed synthesis method is effective, but the method is laborious, and 
all resultant cDNA clones have a tract of deoxyguanidylate immediately upstream 
of the mRNA sequence. This deoxyguanidylate tract can interfere with 
transcrq)tion of the DNA in vitro or in vivo and can interfere with the sequencing 

30 of the clones by the Sanger dideoxynucleotide sequencing method. 

Once both cDNA strands have been synthesized, the cDNA library is 
constructed by cloning the cDNAs into an appropriate plasmid or viral vector. 
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15 



In practice this can be done by directly ligating the blunt ends of the cDNAs into 
a vector which has been digested by a restriction endonuclease to produce blunt 
ends. Blunt end Ugations are very inefficient, however, and this is not a 
common method of choice. A generally used method involves adding synthetic 
linkers or adapters containing restriction endonuclease recognition sequences to 
the ends of the cDNAs. The cDNAs can then be cloned into the desired vector 
at a greater efficiency. 

Once a comprehensive cDNA Ubrary is constracted from a ceU line, 
desired genes can be identified with the assistance of subtractive hybridization 
(see for example Sargent T.D., 1987, Meth. Emymol, Vol. 152, pp. 423-432; 
Lee cf fl/., 1991, Proc. Natl. Acad. Sci., USA, Vol. 88, pp. 2825-2830). A 
general method for subtractive hybridization is as follows. The complementaiy 
strand of the cDNA is synthesized and radiolabelled. Hiis single strand of 
cDNA can be made from polyA mRNA or from the existing cDNA library. The 
radiolabeUed cDNA is hybridized to a large excess of mRNA from a closely 
related ceU population. After hybridization the cDNArmRNA hybrids are 
removed from the solution by chromatography on a hydroxylapatite column. The 
remaining "subtracted" radiolabeUed cDNA can then be used to screen a cDNA 
or genomic DNA Ubraiy of the same ceU population. 

Subtractive hybridization removes the majority of the genes expressed in 
both cell populations and thus enriches for genes which are present only in the 
desired ceU population. However, if the expression of a particular mRNA 
sequence is only a few times more abundant in the desired cell population than 
the subtractive population it may not be possible to isolate the gene by 
25 subtractive hybridization. 

Summary of the Invention 

We have discovered a method for identifying, isolating and cloning 
mRNAs as cDNAs using a polymerase amplification method that employs at least 
two oligodeoxynucleotide primers. In one approach, the first primer contains 
sequence c^ble of hybridizing to a site including sequence that is immediately 
upstream of the first A ribonucleotide of the mRNA's polyA tail and the second 
primer contains arbitrary sequence. In another approach, the first primer 
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contains sequence capable of hybridizing to a site including the mBNA's polyA 
signal sequence and the second primer contains aibitiaiy sequence. In another 
approach, the first primer contams aibitraiy sequence and the second primer 
contains sequence c^le of hybridizing to a site including the mENA's Kozak 
5 sequence. In another qjproach, the first primer contains a sequence that is 
substantially complemaitary to the sequence of a mKNA having a known 
sequence and the second primer contains arbitrary sequence. In another 
approach, the first primer contains arbitrary sequence and the second primer 
contains sequence that is substantially identical to the sequence of a mRNA 
having a known sequence. The first primer is used as a primer for reverse 
transcription of tiie mRNA and the resultant cDNA is amplified with a 
polymerase usmg bofli the first and second primers as a primer set. 

Usmg this method with differmt pairs of the alterable primers, virtually 
any or all of the mRNAs from any cell type or any stage of the cell cycle, 
including very low abundance mRNAs, can be identified and isolated. 
Additionany a comparison of the mRNAs from closely related cells, which may 
be for exan^le at difBsient stages of developmait or differait stages of the cell 
cycle, can show which of the mRNAs are constitutively ejqpressed and which are 
differentially expressed, and their respective frequencies of expression. 

The "first primer" or "first oligodeoxynucleotide" as used herein is defined 
as bemg the oligodeoxynucleotide primer that is used for the reverse transcription 
of the mRNA to make the first cDNA strand, and then is also used for 
amplification of the cDNA. The first primer can also be referred to as the 3 ' 
primer, as fliis primer will hybridize to the mRNA and wiU define the 3' end of 
flie first dJNA strand. Hie "second primer' as used herem is defined as bemg 
the oligodeoxynucleotide primer tfiat is used to make the second cDNA strand, 
and is also used for the amplification of the cDNA. The second primer may also 
be referred to as the 5' primer, as this primer will hybridize to the first cDNA 
strand and will define the 5' end of the second cDNA strand. 

The "arbitrary" sequence of an oligodeoxynucleotide primer as used herem 
is defined as bemg based upon or subject to individual judgement or discretion. 
la some instances, the arbitrary sequrace can be entirely random or partly 
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random for one or more bases. In other instances the aibitrary sequence can be 
selected to contain a specific ratio of each deoxynucleotide, for example 
approximately equal proportions of each deoxynucleotide or predominantly one 
deoxynucleotide, or to not contain a specific deoxynucleotide. The aibitraiy 
5 sequence can be selected to contain, or not to contain, a recognition site for 
specific restriction endonuclease. The aibitraiy sequence can be selected to 
either contain a sequence that is substantially identical (at least 50 homologous) 
to a mRNA of known sequence or to not contain sequence from a mKNA of 
known sequence. 

10 An oligodeoxynuceotide primer can be either "complemaitaiy" to a 

sequence or "substantiaUy identical" to a sequence. As defined herein, a 
complementary oUgodeoxynucleotide primer is a primer that contains a sequence 
which will hybridize to an mRNA, that is the bases are complementary to each 
other and a reverse transcriptase wiD be able to extend the primer to foim a 
15 cDNA strand of the mRNA. As defined herein, a substantially identical primer 
is a primer that contains sequence which is the same as the sequence of an 
mRNA, that is greater than 5098 identical, and the primer has the same 
orientation as an mRNA thus it will not hybridize to, or complement, an mRNA 
but such a primer can be used to hybridize to the first cDNA strand and can be 
extended by a polymerase to generate the second cDNA strand. The terns of art 
"hybridization" or "hybridize", as used herein, are defined to be the base pairing 
of an oligodeoxynucleotide primer with a mRNA or cDNA strand. The 
"conditions under which" an oligodeoxynucleotide hybridizes mth an mRNA or 
a cDNA, as used herem, is defined to be temperature and buffer conditions (that 
25 are described later) under which the base pairing of the oUgodeoxynucleotide 
primer with either an mRNA or a cDNA occurs and only a few mismatches (one 
or two) of the base pairing are permissible. 

An oligonucleotide primer can contain a sequence that is known to be a 
"consensus sequence" of an mRNA of known sequence. As defined herein, a 
"consaisus sequence" is a sequaice that has been found in a gene family of 
proteins having a similar function or similar properties. The use of a primer that 
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indudes a consensus sequence may result in the cloning of additional members of 
a desired gene fenuly. 

The "preferred laigth" of an oligodeoxynucleotide primer, as used herein, 
is determined from the desired specificity of annealing and the number of 
oligodeoxynucleotides having the desired specificity that are required to hybridize 
to all the mRNAs in a cell. An oHgodeoxynucleotide primer of 20 nucleotides is 
more specific than an oligodeoxynucleotide primer of 10 nucleotides; however, 
addition of each random nucleotide to an oligodeoxynucleotide primer increases 
by four the number of oligodeoxynucleotide primers required in order to 
hybridize to every mKNA in a cell. 

In one aspect, in general, the invention features a method for identifying 
and isolating mSNAs by priming a preparation of mKNA for revise 
transcr^tion with a first oligodeo^tynucleotide primer that contains sequence 
capable of hybridizing to a site inchiding sequence that is immediately upstream 
of the first A ribonucleotide of the mRNA's polyA tail, and amplifying the 
cDNA by a polymerase amplification metiiod using the first primer and a second 
oligodeoxynucleotide primer, for example a primer having arijitrary sequence, as 
a primer set. 

In preferred embodiments, the first primer contains at least 1 nucleotide at 
the 3' end of the oHgodeoxynucleotide that can hybridize to an mRNA sequence 
that is immediately upstream of the polyA tail, and contains at least 11 
nucleotides at the 5' end that will hybridize to the polyA tail. The entire 3' 
oligodeoxynucleotide is preferably at least 13 nucleotides in length, and can be 
up to 20 nucleotides in length. 

Most prrferably, the first primer contains 2 nucleotides at the 3' end of the 
oligodeo^tynucleotide that can hybridize to an mRNA sequence that is 
immediately upstream of tiie polyA tail. Preferably, the 2 polyA-non- 
complementary nucleotides are of the sequence VN, where V is deoxyadenylate 
("dA"), deoxyguanylate ("dG"), or deoxycytidylate ("dC"), and N, the 3' 
terminal nucleotide, is dA, dG, dC, or deoxythymidylate ("dT"). Thus the 
sequaice of a preferred first primer is S'-ll iri'iTlTlTVN [Seq. ID. No. 1]. 
The use of 2 nucleotides can provide accurate positioning of the first primer at 
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the junction between the mRNA and its polyA tail, as the properly aligned 
oligodeoxynucleotidermRNA hybrids are more stable than improperly aligned 
hybrids, and thus the properly aligned hybrids will form and remain hybridized 
at higher temperatures. In preferred jqxplications, the mRNA sample will be 
5 divided into at least twelve aliquots and one of the 12 possible VN sequences of 
the first primer will be used in each reaction to prime the reverse transcription of 
the mRNA. The use of an oligodeoxynucleotide with a single sequence will 
reduce flie number of mRNAs to be analyzed in each sample by binding to a 
subset of tile mRNAs, statistically l/12tfi, tiius simplifying tiie identification of 
10 the mRNAs in each sample. 

In some embodiments, the 3' end of the first primer can have 1 nucleotide 
that can hybridize to an mRNA sequence that is immediately upstream of the 
polyA tail, and 12 nucleotides at ttie 5' end tiiat will hybridize to the polyA tail, 
thus tile primer will have the sequence S'-TTTrTTITmTV [Seq. ID, No. 2]. 
15 The use of a single non-polyA-complementary deoxynucleotide would decrease 
the number of oligodeoxynucleotides that are required to identify every mRNA to 
3, however, the use of a single nucleotide to position the annealing of primer to 
the junction of the mRNA sequence and the polyA tail may result in a 
significant loss of specificity of the annealing and 2 non-polyA-complementary 
20 nucleotides are preferred. 

In some embodiments, the 3' end of the first primer can have 3 or more 
nucleotides tiiat can hybridize to an mRNA sequence that is immediately 
upstream of tiie polyA tail. The addition of each nucleotide to the 3' end will 
further increase the stability of properly aligned hybrids, and the sequence to 
25 hybridize to flie polyA tail can be decreased by one nucleotide for each additional 
non-polyA-complementary nucleotide added. The use of such a first primer may 
not be practical for rapid screening of the mRNAs contained within a given cell 
line, as the use of a first primer wifli more than 2 nucleotides tiiat hybridize to 
the mRNA immediately upstream of the polyA tail significantiy increases the 
30 number of oligodeoxynucleotides required to identify every mRNA. For 

instance, tiie primer 5'-lliriTmTVNN [Seq, ID. No. 3] would require ttie 
use of 48 separate first primers in order to bmd to every mRNA, and would 
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significantly increase the number of reactions required to screen the mRNA from 
a given cell line. The use of oligodeoxynucleotides with a single random 
nucleotide in one position as a group of four can circumvent the problem of 
needing to set up 48 separate reactions in order to identify every mRNA. 
5 However as the non-poIyA-complementary sequence became longer, it would 
quickly become necessary to increase the number of reactions required to identify 
every mRNA. 

In preferred embodiments, the second primer is of arbitrary sequence and 
is at least 9 nucleotides in length. Preferably the second primer is at most 13 

10 nucleotides in length and can be up to 20 nucleotides in length. 

In another aspect, in general, the invention features a method for preparing 
and isolating mRNAs by priming a preparation of mRNA for reverse 
transcription with a first primer that contains a sequence capable of hybridizing 
to the polyadenylation signal sequence and at least 4 nucleotides that are 

15 positioned 5', or 3', or both of the polyadenylation signal sequence; this entire 
first primer is preferably at least 10 nucleotides in length, and can be up to 20 
nucleotides in length. In one prefraed embodiment the sequence 5'- 
NNTTTATrNN [Seq. ID. No. 4] can be chosen such that the sequence is 5'- 
GCTTTATINC [Seq. ID, No. 5], and the four resultant primers are used 

20 together in a single reaction for the p rimin g of the mRNA for reverse 
transcription. Once the first cDNA strand has been formed by reverse 
transcription then the first primer can be used with a second primer, for example 
and arbitrary sequence primer, for the amplification of the cDNA. 

In one aspect, in general, the invention features a method for identifying 

25 and isolating mRNAs by priming a preparation of mRNA for reverse 

transcription with a first oligodeoxynucleotide primer to generate a first cDNA 
strand, and priming the preparation of the second cDNA strand with a second 
primer that contains sequence substantially identical to the Kozak sequence of 
mRNA, and amplifying the cDNA by a polymerase amplification method using 

30 the first and second primers as a primer set. 

In preferred embodiments, the first and second primers are at least 9 
deojgrnucleotides in length, and are at most 13 nucleotides in length, and can be 
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up to 20 nucleotides in length. Most preferably the first and second primers are 
10 deoxynucleotides in length. 

In preferred embodiments the sequence of the first primer is selected at 
random, or the first primer contains a selected aibitiaiy sequence, or the first 
5 primer contains a restriction endonuclease recognition sequance. 

In preferred ranbodiments the sequence of the second primer that contains 
. sequence substantially identical to the Kozak sequence of mRNA has the 
sequence NNNANNATGN [Seq. ID No. 6], or has the sequence 
NNNANNATGG [Seq. ID No. 7]. Where N is any of the four 
10 deoxynucleotides. Preferably, the second primer has the sequence 

GCCACCATGG [Seq. ID No. 8]. In some embodiments the first primer may 
further inclucte a restriction endonuclease recognition sequence that is added to 
either the 5' or 3' end of the primer increasing the length of the primer by at 
least 5 nucleotides. 

15 In another aspect, in general, the invention features a method for 

identifying and isolating mRNAs by priming a preparation of mRNA for reverse 
transcrq}tion with a first oligodeoxynudeotide primer that contains sequence that 
is substantially complementary to the sequence of a mRNA having a known 
sequence, and priming the preparation of the second cDNA strand with a second 

20 primer and, amplifying the cDNA by a polymerase amplification method using 
the first and second primers as a primer set. 

In preferred embodiments, the first and second primers are at least 9 
deoxynucleotides in lengtfi, and are at most 13 nucleotides in lengtii, and can be 
up to 20 nucleotides in length. Most preferably flie first and second primers are 

25 10 deoxynucleotides in length. 

In preferred embodiments the sequence of tiie first primer further includes 
a restriction endonuclease sequence, which may be included within the preferred 
10 nucleotides of the primer or may be added to either the 3' or 5' end of the 
primer increasing the loigth of the oligodeoxynudeotide primer by at least 5 

30 nucleotides. 
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la preferred embodiments the sequence of the second primer is selected at 
random, or the second primer contains a selected aibitraiy sequence, or the 
second primer contains a restriction endonuclease recognition sequence. 
Jn another aspect, in general, the invention features a method for 
5 identifying and isolating mRNAs by priming a preparation of mRNA for reverse 
transcrq)tion with a first oligodeoxynucleotide primer, and priming the 
preparation of the second cDNA strand witii a second primer that contains 
sequence that is substantially identical to the sequence of a mRNA having a 
known sequence and, amplifying the cDNA by a polymerase amplification 

10 method using the first and second primers as a primer set. 

In preferred embodiments, the first and second primers are at least 9 
deoxynucleotides in length, and are at most 13 nucleotides in length, and can be 
up to 20 nucleotides in length. Most preferably the first and second primers are 
10 deoxynucleotides in length. 

15 In preferred embodiments the sequence of the first primer is selected at 

random, or the first primer contains a selected arbitrary sequence, or the first 
primer contains a restriction endonuclease recognition sequence. 

In preferred embodiments the sequence of the second primer having a 
sequence that is substantially complementary to the sequence of an mRNA having 

20 a known sequence further includes a restriction endonuclease sequence, which 
may be included within the preferred 10 nucleotides of the primer or may be 
added to either the 3' or 5' end of the primer increasing the length of the 
oligodeoxynucleotide primer by at least 5 nucleotides. 

In another aspect, in general, the invention features a method for 

25 identifying and isolating mRNAs by priming a prq)aration of mRNA for reverse 
transcription with a first oligodeoxynucleotide primer that contains sequence that 
is substantially complementary to the sequence of a mRNA having a known 
sequence, and priming the prq)aration of the second cDNA strand with a second 
primer that contains sequence that is substantially identical to the Kozak sequence 

30 of mRNA, and amplifying the cDNA by a polymerase amplification method 
using the first and second primers as a primer set. 
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In preferred embodiments, the first and second primers are at least 9 
deoxynucleotides in length, and are at most 13 nucleotides in length, and can be 
up to 20 nucleotides in length. Most preferably the first and second primers are 
10 deoxynucleotides in length. 

In some preferred embodiments of each of the general aspects of the 
invention, the amplified cDNAs are separated and then the desired cDNAs are 
reamplified using a polymerase amplification reaction and the first and second 
oligodeoxynucleotide primers. 

In preferred embodiments of each of the general aspects of the invention, a 
set of first and second oligodeoxynucleotide primers can be used, consisting of 
more than one of each primer. In some embodiments more than one of the first 
primer will be included in the reverse transcrqrtion reaction and more than one 
each of the first and second primers will be included in the amplification 
reactions. The use of more than one of each primer will increase the number of 
mRNAs identified in each reaction, and the total number of primers to be used 
will be determined based upon the desired method of sqwrating the cDNAs such 
that it remains possible to fully isolate each individual cDNA. In prefened 
embodiments a few hundred cDNAs can be isolated and idratified using 
denaturing polyacrylamide gel electrophoresis. 

The method according to the invention is a significant advance over current 
cloning techniques that utilize subtractive hybridization. In one aspect, the 
method according to the invraition enables the gaies which are altered in their 
frequency of e3q)ression, as well as of mENAs which ate constitutively and 
differentially e3q)ressed, to be idaitified by simple visual inspection and isolated. 
In another aspect the method according to the invention provides specific 
oligodeoxynucleotide primers for amplificarion of the desired mRNA as cDNA 
and makes unnecessary an intermediary step of adding a homopolymeric tail to 
the first cDNA strand for priming of the second cDNA strand and thereby 
avoiding any interference from the homopolymeric tail with subsequent analysis 
of the isolated gene and its product. In another aspect the method according to 
the invention allows the cloning and sequencing of selected mSNAs, so that flie 
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investigator may determine the relative desirability of the gene prior to screening 
a eomprehensive cDNA library for the fiill length gene product. 

Description of the Preferred Embodiments 

Drawings 

5 Fig. 1 is a schematic representation of the method according to the 

invention. 

Fig. 2 is the sequence of the 3' end of the Nl gene from normal mouse 
fibroblast cells (A31) [Seq. ID. No. 9], 

Fig. 3 is the Northern blot of the Nl sequence on total cellular RNA from 
10 normal and tumorigenic mouse fibroblast cells. 

Fig. 4 is a sequencing gel showing the results of amplification for mRNA 
prepared from four sources Qanes 1-4), usmg the Kozak primer alone, the AP-1 
primer alone, the Kozak and AP-1 primes, the Kozak and AP-2 primers, the 
Kozak and AP-3 primers, the Kozk and AP-4 primers and the Kozak and AP-5 
15 primers. This gel will be more fully described later. 

Fig. 5 is a partial sequence of the 5' end of a clone, Kl, that was cloned 
from the Al-5 cell line that was cultured at the non-permissive temperature and 
then shifted to the permissive temperature (32.5'C) for 24 h prior to the 
preparation of the mSNA. The Al-5 cell line is from a primary rat embryo 
20 fibroblast cell line that has been doubly transformed with ras and a temperature 
srasitive mutation of P*^ ("P^^*"). 
General Description. Development of the Method 

By way of illustration a description of examples of the method of the 
invention follows, with a description by way of guidance of how the particular 
25 illustrative examples were developed. 

It is important for operation of the method that the length of the 
oligodeoxynucleotide be appropriate for specific hybridization to mRNA. In 
order to obtain specific hybridization, whether for conventional cloning methods 
or PCR, oligodeoxynucleotides are usually chosen to be 20 or more nucleotides 
30 in length. The use of long oligodeoxynucleotides in this instance would decrease 
the number of mRNAs identified during each trial and would greatiy increase the 
number of oligodeoxynucleotides required to identify every mRNA. Recentiy, it 
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was demonstrated that 9-10 nucleotide primers can be used for DNA 
polymorphism analysis by PGR (Williams et al, 1991, Nuc, Acids Res., Vol. 
18, pp. 6531-6535). 

The plasmid containing the cloned murine thymidine kinase gene ("TK 
cDNA plasmid'') was used as a model template to determine the required lengths 
of oligodeoxynucleotides for specific hybridization to a mRNA, and for the 
production of specific PGR products. The oligodeoxynucleotide primer chosen to 
hybridize internally in the mRNA was varied between 6 and 13 nucleotides in 
length, and the oligodeoxynucleotide primer chosen to hybridize at the upstream 
end of the polyA tail was varied between 7 and 14 nucleotides in length. After 
numerous trials with different sets and lengths of primers, it was determined that 
the annealing temperature of 42**C is optimal for product specificity and the 
internally hybridizing oligodeoxynucleotide should be at least 9 nucleotides in 
length and a oligodeoxynucleotide that is at least 13 nucleotides in length is 
required to bind to the upstream end of the polyA tail. 

With reference now to Fig. 1, the method according to the invention is 
depicted schematically. The mRNAs arc mixed with the first primer, for 
example i riTll'lTlTrVN [Seq. ID. No. 2] (TnVN) 1, and reverse 
transcribed 2 to make the first cDNA strand. The cDNA is amplified as follows. 
The first cDNA strand is added to the second primer and the first primer and the 
polymerase in the standard buffer with the appropriate concentrations of 
nucleotides and the components are heated to 94**C to denature the mRNAxDNA 
hybrid 3, the temperature is reduced to 42'*C to allow the second primer to 
anneal 4, and then the temperature is increased to 72 ''C to allow the polymerase 
to extend tiie second primer 5. The cycling of die temperature is then repeated 
6, 7, 8, to begin the amplification of the sequences which are hybridized by the 
first and second primers. The temperature is cycled until the desired number of 
copies of each sequence have been made. 

As is well known in the art, this amplification method can be accomplished 
using thermal stable polymerase or a polymerase that is not thermal stable. 
When a polymerase that is not thermal stable is used, fresh polymerase must be 
added after the aimealing of the primers to the templates at the start of the 



93/18176 



PCr/US93/02246 



- 14- 

elongation or extending step, and the extension step must be carried out at a 
tanperature that is permissible for the chosen polymerase. 

The Mowing examples of the method of the invention are presented for 
illustrative puiposes only. As wiU be appreciated, the method according to the 
invention can be used for the isolation of polyA mRNA ftom any source and can 
be used to isolate genes expressed either differentially or constitutively at any 
level, from tare to abundant. 

Bcperimentation with the conditions required for accurate and rqnoducible 
results by PGR were conducted with the TK cDNA plasmid and a single set of 
oligodeoxynucleotide primers; the sequence lllllTiTmCA ("T„CA") [Seq. 
ID. No. 10] was chosen to hybridize to the upstream aid of the polyA tail and 
the sequence CTTGATTGCC CTtk3") [Seq. ID. No. 11] was chosen to 
hybridize 288 base pairs ("bp") upstream of the polyA tail. The expected 
fragment size using these two primars is 299 bp. 

PGR was conducted under standard buffer conditions well known in the art 
with 10 ng TK cDNA plasmid (buffer and polymerase are available ftom Perkin 
Ehner-Cetus). The standard conditions were altered in that the primers were 
used at concentrations of 2.5 /tM T^CA [Seq. ID. No. 10] , 0.5 ^M Ltk3 [Seq. 
ID. No. 11] , instead of 1 ^ of eadi primer. The concentration of the 
nucleotides ("dNTPs") was also varied over a 100 fold range, from the standard 
200 ^ to 2 fiM. The PGR parameters were 40 cycles of a denaturing step for 
30 seconds at 94»C, an annealing step for 1 minute at 42*G, and an extension 
step for 30 seconds at 72°G. Significant amounts of non-specific PGR products 
were observed when the dNTP concentration was 200 pM, concentrations of 
dNTPs at or below 20 /tM yielded specifically amplified PGR products. The 
specificity of the PGR products was verified by restriction endonuclease digest of 
the amplified DNA, which yielded the expected sizes of restriction fragments. In 
some instances it was found that the use of up to 5 fold more of the first primer 
than the second primer also functioned to increase the specificity of the product. 
Loweiing the dNTP concentration to 2 ^ aUowed the labelling of the PGR 
products to a high specific activity with [a-«S] dATP, 0.5 /tM [a-^^S] dATP (Sp. 
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Act. 1200 Ci/mmol), which is necessary for distinguishing the PCR products 
when resolved by high resolution denaturing polyaciylamide gel electrophoresis, 
in this case a DNA sequencing gel. 
Examp le 7. 

5 The PGR method of amplification with short oligodeoxynucleotide primers 

was then used to detect a subset of mRNAs in mammalian cells. Total RNAs 
and mRNAs were prepared from mouse fibroblasts cells which were either 
growing nonnally, "cycling", or serum starved, "quiescent". The RNAs and 
mRNAs were reverse transcribed with T„CA [Seq. ID. No. 10] as the primer. 

10 The T„CA primer [Seq. ID. No. 10] was annealed to the mRNA by heating the 
mRNA and primer together to 65»C and allowing the mixture to giaduaUy cool 
to 35"C. The reverse transcription reaction was carried out with Moloney 
murine leukemia virus reverse transcriptase at SS'C. The resultant cDNAs were 
amplified by PGR in the presence of Ti,CA [Seq. ID. No. 10] and Ltk3 [Seq. 

15 ID. No. 11] , as described in Example 1, using 2 fiM dNTPs. The use of the 
T„CA [Seq. ID. No. 10] and Ltk3 [Seq. ID. No. 11] primers allowed the TK 
mRNA to be used as an internal control for differential expression of a rare 
mRNA transcript; TK mRNA is present at approximately 30 copies per cell. 
The DNA sequencing gel revealed 50 to 100 amplified mRNAs in the size range 

20 which is optimal for fiirther analysis, between 100 to 500 nucleotides. The 

patterns of the mRNA species observed in cycling and quiescent ceUs were very 
similar as expected, though some differences were apparent. Notably, the TK 
gene mRNA, which is expressed during Gl and S phase, was found only in the 
RNA prq)arations from cycling cells, as ejqwcted, thus demonstrating the ability 

25 of this method to separate and isolate rare mRNA species such as TK. 
Examples 

The expression of mRNAs in normal and tumorigenic mouse fibroblast 
cells was also compared using the T„CA [Seq. ID. No. 10] and Ltk3 [Seq. ID. 
No; 11] primers for the PGR amplification. The mRNA was reverse transcribed 
30 using T„GA [Seq. ID. No. 10] as the primer and the resultant cDNA was 

amplified by PGR using 2 /M dNTPs and the PGR parameters described above. 
The PGR products were separated on a DNA sequencing gel. The TK mRNA 



wo 93/18176 



PCr/US93/02246 



- 16- 

was present at the same level in both the normal and tumorigenic mRNA 
preparations, as expected, and provided a good internal control to demonstntte 
the representation of rare mRNA species. Several other bands were present in 
one preparation and not in the other, with a few bands present in only the mRNA 
5 from normal cdls and a few bands present only in the mRNA fiom the 
tumorigenic cells; and some bands were expressed to different levels in the 
normal and tmnorigraiic cells. Thus, the method according to the invention can 
be used to identify genes which are normally continuously e?q)ressed 
(constitutive), and differentiaUy expressed, suppressed, or othawise altered in 
10 their level of expression. 

Cloning of the mRNA identified in Rvanip lP. ^ 

Three cDNAs that are, the TK cDNA, one cDNA expressed only in 
normal cells ("Nl"), and one cDNA e3q>ressed only in tumorigenic cells ("Tl"), 
were recovered from the DNA sequencing gel by electroelution, ethanol 

15 predpitated to remove the urea and other contaminants, and reamplified by PGR, 
in two consecutive PGR amplifications of 40 cycles each, with the primers T,iCA 
[Seq. ID. No. 10] and Uk3 [Seq. ID. No. 11] in the presence of 20 fjM dNTPs 
to achieve optimal yield without compromising the specificity. The reamplified 
PGR products were confirmed to have the appropriate sizes and primer 

20 dqiendendes as an additional control the reamplified TK cDNA was digested 
witii two sq)arate restriction endonucleases and the digestion products were also 
confirmed to be of the correct size. 

The reamplified Nl [Seq. ID. No. 9] was cloned with the TA cloning 
Systran, Invitrogen Lie, into the plasmid pCRlOOO and sequenced. With 

25 reference now to Fig. 2, the nucleotide sequence clearly shows the Nl fragment 
[Seq. ID. No. 9] to be flanked by the underlined L&3 primer 15 at the 5' end 
and the underlined TuGA primer 16 at the 3' end as expected. 

A NorthHn analysis of total cellular RNA using a radiolabelled Nl probe 
reconfirmed that tiie Nl mRNA was only present in the normal mouse fibroblast 

30 cells, and not in the tumorigenic mouse fibroblast cells. With reference now to 
Kg. 3, the probe used to detect the mRNA is labelled to the right of the figure, 
and the size of the Nl mRNA can be estimated from the 28S and 18S marters 
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depicted to the left of the figure. The Nl mRNA is present at low abundance in 
both exponentially growing and quiescent normal cells, lanes 1 and 3, and is 
absent from both exponentially growing or quiescent tumorigenic cells, lanes 2 
and 4. As a control, the same Noithem blot was reprobed with a ladiolabelled 
S probe for 36B4, a gene that is expressed in both normal and tumorigenic cells, to 
demonstrate that equal amounts of mRNA, lanes 1-4, were present on the 
Noithem blot. 
Example 4 

The comparison of the expression of mRNAs in three cell lines, one of 

10 which was tested after culturing under two different conditions, was conducted. 
The cell lines were a primary rat embryo fibroblast cell line ("REF"), the REF 
cell line that has been doubly transformed with ras and a mutant of P'^ C'TIOI- 
4"), and the REF cell line that has been doubly transformed with ras and a 
temperature sensitive mutation of P^^ ("Al-5"). The Al-5 cell line was cultured 

15 at the non-permissive temperature of 37oC, and also cultured at 37oC then 

shifted to the permissive temperature of 32.5oC for 24 h prior to the preparation 
of the mRNA. The method of the mvention was conducted using the primers 
"Kozak" and one of five arbitrary sequence primers, "AP-1, AP-2, AP-3. AP-4, 
or AP-S", as the second and first primers, respectively. 

20 The sequence of the ""Kozak" primer was chosen based upon the published 

consensus sequence for the translation start site consensus sequence of mRNAs 
(Kozak, 1991, Jour, Cell Biology, Vol. 115, pp. 887-903). A degenerate Kozak 
primer having sequences substantially identical to the translation start site 
consensus sequence were used simultaneously, these sequences were 5'- 

25 GCCRCCATGG [Seq. ID No. 12], in which the R is dA or dO and thus the 
oligodeoxynucleotide primer has only one of the given nucleotides which results 
in a mixture of primers. 

The sequence of the five aibitrary primers was a foUows: AP-1 had the 
sequence 5*-AGCCAGCGAA [Seq, ID. No. 13]; AP-2 had the sequence 

30 S'-GACCGCTTGT [Seq. ID. No. 14]; AP-3 had the sequence 5'- 

AGGTGACCGT [Seq. ID. No. 15]; AP-4 had the sequence 5'-GGTACTCCAC 
[Seq. ID. No. 16]; and AP-5 had the sequence 5'-GTTGCGATCC [Seq. ID. No. 
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17]. These aibitiaiy sequence primers were chosen arbitrarily. In general each 
arbitrary sequence primer was chosen to have a GC content of 50-70%. 

The mENA was reverse transcribed using one of the AP primers, as the 
first primer, and the resultant first cDNA strand was amplified in the presence of 
both primers, the AP primer and the degenerate Kbzak pmaex, by PGR using 2 
fiM. NTPs and the PGR parameters described above. The PGR products were 
separated on a DNA sequencing gel. At least 50-100 amplified cDNA bands 
were present in each of the cell lines tested, and some bands were expressed to 
different levels in the different cell lines. As a control a reaction was conducted 
using each arbitrary primer in the absence of the Kozak primer. No cDNA was 
generated by the arbitrary primer alone, thus dononstrating that both primers 
were required to amplify an mENA into a cDNA. 

With reference now to Fig. 4, the primer sets used for each reaction are 
shown at the top of the Fig, along the line marked Primers . As a control a 
reaction was conducted using die primers in the absence of mRNA, and using 
AP-1 wifli mRNA in the absence of the Kozak primer. No cDNA was generated 
by the primers in the absence of mRNA or by flie arbitrary primer alone, thus 
demonstrating that mRNA is required for amplification and that both primers 
were required to amplify an mRNA into a cDNA. The cDNA products of the 
amplification were loaded in the same order across the gel, thus the REF cell line 
is shown in each of lanes 1, cell line TlOl-4 is shown in each of lanes 2, cell 
line Al-5 cultured at 37oC is shown in each of lanes 3, and cell line Al-5 
cultured at 32.5oC is shown in each of lanes 4. Each pair of primers resulted in 
the amplification of a different set of mRNAs ftom the cell lines. The reactions 
which were conducted using the Kozak primer and any of primers AP-1, AP-2, 
AP-4, or AP-5 as a primer set resulted in the amplification of the same cDNA 
pattern ftom each of cell lines REF, TlOl-4, Al-5 cultured at 37oG and Al-5 
cultured at 32.5oC. The amplification of mRNA from each ceU line and 
temperature using the Kozak degenerate primer and the AP-3 primer resulted in 
the finding of one band in particular which was present in the mRNA prepared 
ftom the Al-5 cell line when cultured at 32.5oC for 24 h, and not in any of the 
otiier mRNA preparations, as can be seen in Fig. 4 designated as Ki. Thus tiie 
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method according to the invention may be used to identify genes which are 
differentially expressed in mutant cell lines, 
aoning of the m RNA idenrified in Examplp. A 

The cDNA ("Ki") that was expressed only in the Al-5 cell line when 
5 cultured at 32.5oC was recovered from the DNA sequencing gel and reamplified 
using the primers Kozak and AP-3 as described above. The reamplified Ki 
cDNA was confirmed to have the appropriate size of approximately 450 bp, and 
was cloned with the TA cloning system, Invitrogen Inc., into the vector pCRH 
(Invitrogen, Inc.) according to the manufacturers instructions, and sequenced. 
10 With reference now to Fig. 5, the nucleotide sequence clearly shows the Ki 
clone to be flanked by the underlined Kozak primer 20 at the 5' end and the 
underiined AP-3 primer 21 at the 3 ' end as expected. The 5 ' end of this partial 
cDNA is identified in Seq. ID No. 18, and the 3' end of this cDNA is identified 
in Seq. ID No. 19. This partial sequence is an open reading fcame, and a search 
15 of the gene databases EMBO and Genbank has revealed the translated amino acid 
sequence firom the 3' portion of Ki to be homologous to the ubiquitin conjugating 
enzyme femily (UBC enzyme). The translated amino acid sequence of the 3' 
portion of Ki is 100% identical to a UBC enzyme from D. melanogaster and 
75% identical to the UBC-4 enzyme and 79% identical to the UBC-5 enzyme 
20 from the yeast S. saccharomyces; and 75 % identical to the UBC enzyme from 
Arabidopsis thaliana. The Ki clone may contain the actual 5' end of this gene, 
otherwise the Kozak primer hybridized just after the 5' end. This result 
demonstrates that the method according to the invention can be used to clone the 
5' coding sequence of a gene 
25 Hsg 

The method according to the invention can be used to identify, isolate and 
clone mSNAs from any number of sources. The method provides for the 
identification of desirable mRNAs by simple visual inspection after separation, 
and can be used for investigative research, industrial and medical plications. 
30 For instance, the reamplified cDNAs can be sequenced, or used to screen a 

DNA library in order to obtain the full length gene. Once the sequence of the 
cpNA is known, amino acid peptides can be made from the translated protein 
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sequence and used to raise antibodies. These antibodies can be used for fiirttier 
research of flie gene product and its function, or can be appKed to medical 
diagnosis and prognosis. The reamplified cDNAs can be cloned into an 
qypropriate vector for further propagation, or cloned into an appropriate 
5 e3q)iession vector in order to be expressed, either in vitro or in ^4vo. The 

cDNAs which have been cloned into expression vectors can be used in industrial 
situations for overproduction of the protein product. In other applications the 
reamplified cDNAs or their respective clones will be used as probes for in situ 
hybridization. Such probes can also be used for the diagnosis or prognosis of 
10 disease. 

Other Embodiments 
Other embodiments are within the following claims. 
The length of the oligodeoxynucleotide can be varied dependent upon the 
annealing temperature chosen. In the preferred embodiments the temperature 
15 was chosen to be 42**C and the oligonucleotide primers were chosen to be at 
least 9 nucleotides in length. If the aimealing temperature were decreased to 
35 °C then the oligonucleotide lengths can be decreased to at least 6 nucleotides 
in length. 

The cDNA could be radiolabelled with radioactive nucleotides other than 
20 ^^S, such as and ^^P. When desired, non-radioactive imagmg methods can 
also be applied to the method according to the invention. 

The amplification of the cDNA could be accomplished by a temperature 
cycling polymerase chain reaction, as was desOTbed, using a heat stable DNA 
polymerase for the repetitive copying of the cDNA while cycling the temperature 
25 for contmuous rounds of denaturation, annealing and extension. Or the 

amplification could be accomplished by an isothermal DNA amplification method 
(Walker etal, 1992, Proc. Natl Acad. S'd., Vol. 89, pp. 392-396), The 
isothermal amplification method would be adapted to use for amplifying cDNA 
by including an appropriate restriction endonuclease sequence, one that will be 
30 nicked at hemiphosphorothioate recognition sites and whose recognition site can 
be regenerated during synthesis with o^^S labelled dNTPs. 
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Proteins having similar function or similar functional domains are often 
referred to as being part of a gene family. Many such proteins have been cloned 
and identified to contain consensus sequences which are highly conserved 
amongst the members of the family. This conservation of sequence can be used 
5 to design oUgodeoxynucleotide primers for the cloning of new members, or 
related members, of a £amily. Usmg the method of the invention the mRNA 
from a cell can be reverse transcribed, and a cDNA could be amplified using at 
least one primer that has a sequence substantially identical to the sequence of a 
mSNA of known sequence. Consensus sequences for at least the following 

10 families and functional domains have been described in the literature: protein 
tyrosine kinases (Hanks et a/., 1991, Methods on Enzymologyy Vol. 200, pp. 38- 
81; Wilks, 1991, Methods in Enzymology, Vol. 200, pp. 533-546); homeobox 
genes; zinc-finger DNA bmding proteins (Miller et al , 1985, EMBO Jour. , Vol. 
4, pp. 1609-1614); receptor proteins; the signal peptide sequence of secreted 

15 proteins; proteins that localize to the nucleus (Guiochon-Mantel et al , 1989, 
Vol. 57, pp. 1147-1154); serine proteases; inhibitors of serine proteases; 
cytokines; the SH2 and SH3 domains that have been described in tyrosine kinases 
and other proteins (Pawson et al, 1992, Cell, Vol 71, pp. 359-362); 
serine/threonine and tyrosine phosphatases (Cohen, 1991, Methods in 

20 Enzymology, Vol. 201, pp. 398-408); cyclins and cyclin-dependent protem 
kinases (CDKs) {see for ex., Keyomarsi et al, 1993, Proc. Natl Acad. Sd., 
USA, Vol. 90, pp. 1112-1116). 

Primers for any consensus sequence can readily be designed based upon the 
codon usage of the amino acids. The incoiporation of degeneracy at one or more 

25 sites allows the designing of a primer which will hybridize to a high percentage, 
greater than 50%, of the mRNAs containing the desired consensus sequence. 

Primers for use in the method accordmg to the invention could be designed 
based upon the consensus sequence of the zinc finger DNA binding proteins, for 
example, based upon the amino acid consensus sequence of the proteins PYVC, 

30 Useful primers for the cloning of further members of this family can tiave the 
following sequences: 5'-GTAYGCNTGT [Seq, ID, No. 20] or 5'- 
GTAYGCNTGC [Seq. ID. No, 21], in which the Y refers to the 
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deoxynucleotides dT or dC for which the primer is degenerate at this position, 
and the N refers to inosme ("I"). The base inosine can pair with all of the 
other bases, and was chosen for this position of the oligodeoxynucleotide as the 
codon for valine " V is highly degenerate in this position. The described 
5 oligodeoxynucleotide primers as used will be a mixture of S'-GTATGCITGT 
and 5'-GTACGCITGT or a mixture of 5'-GTATGCITGC and 
5;-GTACGCrrGC. 
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SEQDHNC& LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Liang, Peng 

Pardee, Arthur B. 

5 (ii) TITLE OP INVENTION: Identifying, Isolating and Cloning 

Messenger RNAs 

(iii) NUMBER OF SEQUENCES: 21 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Choate, Hall & Stewart 
10 (B) STREET: Exchange Place, 53 State Street 

(C) CITY: Boston 

(D) STATE: Massachusetts 

(E) COUNTRY: U.S.A. 
(P) ZIP: 02190 

15 (v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentin Release #1,0, Version #1.25 
20 (vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 11 -MAR- 1993 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

25 (A) APPLICATION NUMBER: US 07/850,343 

(B) FILING DATE: 11 -MAR- 1992 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME : Pastemack, Saxn 

(B) REGISTRATION NUMBER: 29,576 

30 (C) REFERENCE/DOCKET NUMBER: DFCI234CIP 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617 227-5020 

(B) TELEFAX: 617 227-7566 

(2) INFORMATION FOR SEQ ID N0:1: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (ii) KDLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
TTTTTTTTTT TVN 13 
45 (2) INFORMATION FOR SEQ ID NO: 2: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TTPEz nucleic acid 

(C) STRANDKDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECDLE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2; 

10 rmTiTiTr ttv 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 

TTTTTTTTrr VNN 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
25 (B) TXPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(iii) HYPOTHETICAL: NO 
30 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 
NNTTTATTNN 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
40 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:5: 
GCTTTATTNC 

(2} INFORMATION FOR SEQ ID N0:6: 
45 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

5 (ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
NNNANNATGN 
10 (2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
20 NNNANNATGG 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GCCACCATGG 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 260 base pairs 
35 (B) TYPE; nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 
40 (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CTTGATTGCC TCCTACAGCA GTTGCAGGCA CCTTTAGCTG TACCATGAAG TTCACAGTCC 60 
GGGATTQTGA CCCTAATACT GOAGTTCCAG ATGAAGATGG ATATGATGAT GAATATGTGC 120 
TGGAAGATCT TGAQGTAACT GTGTCTGATC ATATTCAGAA GATACTAAAA CCTAACTTCG 180 
45 CTGCTGCCTG GGAAGAGGTG GGAGGAGCAG CTGCGACAGA GCGTCCTCIT CACAGAGGGG 240 
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TCCTGGGTGA AAAAAAAAAA 
(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECOLE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
10 (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10 
TTITTTTTTT TCA 

(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
20 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GTTGATTGCC 

(2) INFORMATION FOR SEQ ID N0:12r 
25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: Other nucleic acid 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 
GCCRCCATGG 
35 (2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Other nucleic acid 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13; 
45 AGCCAGCGAA 
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(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

10 ■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 

GACCGCTTQT 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 
20 (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
AGGTGACCGT 

(2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
30 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGTACTCCAC 

(2) INFORMATION FOR SEQ ID NO; 17: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 
40 (ii) MOLECULE TYPE: Other nucleic acid 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTTGCGATCC 
45 (2) INFORMATION FOR SEQ ID NO: 18: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 baise pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
10 GCCGCCATGG CTCTGAAGAG AATCCACAAG GACACCCATG AA 42 
(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 
Civ) ANTI -SENSE: NO 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

GTTGCATTTA CAACAAGAAT TTATCATCCA AATATTAACA GTAATGGCAG CATTTGTCTT 60 
GATATTCTAC GGTCACCT 78 
(2) INFORMATION FOR SEQ ID NO: 20: 
(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) 'TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
30 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GTAYGCNTGT 10 
(2) INFORMATION FOR SEQ ID N0:2l: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: other nucleic acid 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:21: 
GTAYGCNTGC XO 

45 
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Claims 

1. A non-specific cloning method for isolating in a nucleic acid sample a 
DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oligodeoxynucleotide under conditions in 
which said first oligodeoxynucleotide hybridizes with mRNA at a site including a 
sequence immediately upstream of a first A ribonucleotide of the mRNA's polyA 
tail, 

reverse transcribing the mRNA using a reverse transcrq)tase, using said 
first oligodeoxynucleotide as a piimer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from the site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with DNA, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream from the site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a DNA polymerase, 
using said first and second oligodeoxynucleotides as primers. 

2. The method of claim 1 wherein said first oligodeoxynucleotide 
hybridizes with the mRNA at a site that includes at least one base upstream from 
and adjacent to the first A ribonucleotide of the polyA tail. 

3. The method of claim 2 wherein said first oligodeoxynucleotide 
hybridizes with the mRNA at a site that includes at least two bases upstream 
ftom and adjacent to the first A ribonucleotide of the polyA tail. 

4. The method of claim 1 wherein said first oligodeoxynucleotide 
includes a polyA-conq)lementary region comprismg at least 11 bases and, 
upstream ftom said polyA-compIementary region, a non-poly-A complementary 
region comprising at least one base, 

5. The method of claim 4 wherein said non-poly A-complementary 
region comprises at least 2 contiguous bases 
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6. The method of claim 5 wherein said non-polyA-complementaiy 
region comprises 3'-NV, wherein V is one of dA, dC or dG, and N is one of 
dA, dT, dC or dG. 

7. The method of daim 4 wherein said first oligodeoxynucleotide 
comprises at least 13 bases. 

8. The method of claim 1 wherein said second oligodeoxynucleotide 
comprises at least 6 deoxyribonucleotides. 

9. The method of claim 1 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

10. The method of claim 1 wherein said second oligodeoxynucleotide 
includes a randomly selected nucleotide sequence. 

11. The method of claim 1 wherein said first or second 
oligodeojqfnucleotide includes a selected arbitrary sequence. 

12. The method of claim 1 wh»ein said first or the second 
oligodeoj^ucleotide includes dC, dG, dT and dA. 

13. The method of claim 1 wherein said first or second 
oHgodeoxynucleotide includes a restriction endonuclease recognition sequence. 

14. The method of claim I wherein said second oligodeo}qmucleotide 
includes a sequence identical to a sequence contained within a mRNA of known 
sequence. 

15. The method of claim 1 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

16. A non-specific cloning method for isolating in a nucleic acid sample a 
DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oligodeoxynucleotide under conditions in 
which said first oligodeoxynucleotide hybridizes with mRNA at a site that 
includes the mRNA's polyA signal sequence, 

rev«se transcribing the mRNA using a reverse transcriptase, usmg said 
first oligodeoxynucleotide as a primer, to produce a first -DNA strand 
complementary to at least a portion of the mRNA upstream from the site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 
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contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with DNA, 

extending the second oHgodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementaiy to the first DNA strand 
downstream from the site of hybridization of said second oUgodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a DNA polymerase, 
using said first and second oligodeoxynucleotides as primers. 

17. The method of claim 16 wherein said first oligodeoxynucleotide 
comprises at least 6 nucleotides. 

18. The method of claim 16 wherein said first oligodeoxynucleotide 
comprises at least 9 nucleotides. 

19. The method of claim 16 wherein said second oligodeoxynucleotide 
comprises at least 6 deoxyribonucleotides. 

20. The method of claim 16 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

21. The method of claim 16 wherein said second oligodeoxynucleotide 
includes a randomly selected nucleotide sequence. 

22. The method of claim 16 wherein said first or second 
oligodeoxynucleotide includes a selected arbitrary sequence. 

23. The method of claim 16 wherein said first or the second 
oligodeoxynucleotide includes dC, dG, dT and dA. 

24. The mediod of claim 16 wherein said first or second 
oligodeoxynucleotide includes a restriction endonuclease recognition sequence. 

25. The method of claim 16 wherein said second oligodeoxynucleotide 
includes a sequence identical to a sequence contained within a mRNA of known 
sequence. 

26. The method of claim 16 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

27. A method for isolating in a nucleic acid sample a DNA 
complementary to a mKNA, comprising 
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contacting the mRNA with a first oligodeoxynucleotide under conditions in 
which said first oligodeoxynucleotide hybridizes with mRNA at a site, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from said site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with the DNA 
strand at a site, said site including a Kozak sequence, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream ficom (he site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeoxynucleotides as primers. 

28. The method of clann 27 wherein said first oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

29. The method of claim 27 wherein said first oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

30. The method of claim 27 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

31. The method of claim 27 wherein said second oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

32. The method of claim 27 wherem said first oligodeoxynucleotide is 
composed of a randomly selected sequence of deojgrrfbonucleotides. 

33. The method of claim 27 wherein said first oligodeoxynucleotide 
includes a selected arbitrary sequence of deoxyribonucleotides. 

34. The method of claim 27 wherein said first oligodeoxynucleotide 
includes a restriction endonuclease recognition sequence. 

35. The method of claim 27 wherein said first oligodeoxynucleotide 
includes a sequence substantially identical to a sequence contained within an 
mRNA of known sequence. 
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36. The method of claim 27 wherein said second oligodeoxynucleotide 
further includes a restriction endonuclease sequence. 

37. The method of claim 27 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

38. A non-specific cloning method for isolating in a nucleic acid sample 
a DNA complementary to a mRNA, comprising 

contactmg the mRNA with a first oligodeoxynucleotide, having a base 
sequence substantially complementary to a sequence in a mRNA of known 
sequence, under conditions in which said first oligodeoxynucleotide hybridizes 
with mRNA at a site having said substantially identical sequence, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from said site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with the DNA 
strand at a site, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream from said site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeoxynucleotides as primers. 

39. The method of claim 38 wherein said first oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

40. The method of claim 38 wherein said first oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

41. The method of claim 38 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

42. The method of claim 38 wherein said second oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 
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43. The method of claim 38 wherein said first oligodeoxymicleotide 
further inchides a restrictioii endonuclease sequence. 

44. The method of claim 38 wherein said second oligodeoxynucleotide 
is composed of a randomly selected sequence of deoxyribonucleotides. 

45. The method of claim 38 wherein said second oligodeoxynucleotide 
includes a selected arbitrary sequence of deoxyribonucleotides. 

46. The method of claim 38 wherein the base sequence of said second 
oligodeoxynucleotide contains a restriction endonuclease recognition sequence. 

47. The method of claim 38 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

48. A non-specific cloning method for isolating in a nucleic acid sample 
a DNA co^^)lementary to a mRNA, comprising 

contacting the mRNA with a first oligodeoxynucleotide under conditions in 
which said first oligodeoxynucleotide hybridizes with mRNA at a site, 

reverse transcribiag the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from said site of 
hybridization of said first oligodeoxynucleotide with the mRNA^ 

contacting the first DNA strand with a second oligodeoxynucleotide, having 
a sequence substantially identical to a sequence in a mRNA of known sequence, 
under conditions in which said second oligodeoxynucleotide hybridizes with the 
first DNA strand at a site containing a complement of said substantially identical 
sequence, 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 
downstream from said site of hybridization of said second oligodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeoxynucleotides as primers. 

49. The method of claim 48 wherein said first oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 
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50. The method of claim 48 wherein said first oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

51. Hie method of claim 48 wherein said second oligodeoxynucleotide 
comprises at least 9 deoxyribonucleotides. 

52. The method of claim 48 wherein said second oligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

53. The method of claim 48 wherein said first oligodeoxynucleotide is 
composed of a randomly selected sequence of deoxyribonucleotides. 

54. The method of claim 48 wherein said first oligodeoxynucleotide 
includes a selected arbitrary sequence of deoxyribonucleotides. 

55. The method of claim 48 wherein said first oligodeoxynucleotide 
contains a restriction endonuclease recognition sequence. 

56. The method of claim 48 wherein said second oligodeoxynucleotide 
further includes a restriction endonuclease sequence. 

57. The method of claim 48 wherein at least one of said first or second 
oligodeoxynucleotides comprises a plurality of oligodeoxynucleotides. 

58. A non-specific cloning method for isolating in a nucleic acid sample 
a DNA complementary to a mRNA, comprising 

contacting the mRNA with a first oligodeoxynucleotide, having a sequence 
substantially complementary to a sequence in a mRNA of known sequence, under 
conditions in which said first oligodeoxynucleotide hybridizes with mBNA at a 
site containing said substantially identical sequence, 

reverse transcribing the mRNA using a reverse transcriptase, using said 
first oligodeoxynucleotide as a primer, to produce a first DNA strand 
complementary to at least a portion of the mRNA upstream from said site of 
hybridization of said first oligodeoxynucleotide with the mRNA, 

contacting the first DNA strand with a second oligodeoxynucleotide under 
conditions in which said second oligodeoxynucleotide hybridizes with the DNA 
strand at a site, said site including a Kozak sequence, . 

extending the second oligodeoxynucleotide using a DNA polymerase to 
produce a second DNA strand complementary to the first DNA strand 



wo 93/18176 



PCr/US93/02246 



-36- 

downstream from said site of hybridization of said second ofigodeoxynucleotide 
with said first DNA strand, and 

amplifying the first and second DNA strands using a polymerase, using 
said first and second oligodeoxynucleotides as primers. 

59. The method of claim 58 wherran said first ofigodeoxynucleotide 
comprises at least 9 deojgnibonucleotides. 

60. The method of claim 58 wherein said first pligodeoxynucleotide 
comprises 10 deoxyribonucleotides. 

61 . The method of claim 58 wherein said second oligodeoxynudeotide 
comprises at least 9 deoxyribonucleotides. 

62. The mefliod of claim 58 wherein said second oligodeojqmucleotide 
comprises 10 deoxyribonucleotides. 
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